Link to my Report
Click to Download the Report
Click to View the Dataset
As we know, most people infected with COVID-19 will experience mild to moderate respiratory illness and recover without requiring special treatment. However, some will become seriously ill or die at any age. The data set of this project is from CDC (Centers for Disease Control and Prevention) and talks about provisional COVID-19 deaths in the United States by date, state, sex, and age groups.
The objective of this project is to explore the answer to the questions below:
Do COVID-19 death cases decrease by date?
Do COVID-19 death cases vary by state, sex, and age group?
The first chart is the line plot of Covid-19 death cases by date. The range of the date is from January 2020 to November 2022. The different color here in the plot represents the different state. By this, we can find out distribution by date among states as well. From the chart, We can find that from 2020 to 2022, the overall number of deaths shows a trend of increasing first and then decreasing. The data peaks in early 2021, and the recent death data (end of 2022) is much smaller than the data when the virus just come out to spread (2020). It indicates that the death rate from Covid-19 has been greatly reduced. From the perspective of states, the area with the largest number of deaths in 2020 in New York City. It is noticed that blue lines are at higher levels, referring to the vicinity of New York State. After entering 2021, California’s data began to rise rapidly and reached the highest peak. At the end of 2021, there is a decline in CA, and the peaks become Florida and Texas.
From the chart, the number of death cases shows an apparent increasing trend with increasing age. It is worth mentioning that the bars of different colors in the figure refer to different genders: blue represents males, and red represents females. From the chart, except for the two age groups 55-64 and over 85, most of the other groups have more male deaths than females. This is a gender-related result that can be reflected in this graph.
For sex groups analysis, the next figure is one histogram of death cases colored by gender. The figure only shows the data with the count of death cases below 250. Because after observing the dataset, most of the observations are below 250. In this case, the distribution of gender classification can be indicated more intuitively. It can be clearly seen from the histogram that most of the observations for death cases under 250 have more males than females. Combining the above visualization results, we can conclude that in the three-year Covid-19 death data from 2020 to 2022, there are more males than females.
This plot shows the relationship between the number of Covid-19 deaths and the total number of deaths. It is not difficult to find that these two variables have a certain degree of correlation. With the increase in the total deaths, number of deaths from Covid-19 is close to linear growth. From the perspective of states, this interactive plot helps show those outlier data points.
During these three years, the number of Covid-19 deaths has fluctuated, showing an increasing trend until the beginning of 2021 and starting to decline significantly thereafter; For the data of each state, California, Texas, and Florida have the top three most total Covid-19 death data. New York City, California, and Florida have all reached the highest level among states at different times; The Covid-19 death cases increase by age group, with higher age groups having more deaths; In terms of gender, most of the total number and proportion of Covid-19 deaths in the male group is higher the female group.
Copyright © 2022, Chen Chen.